

- 51 - 



SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: SOLVAY (Societe Anonyme) 
5 (B) STREET: rue du Prince Albert, 33 

(C) CITY: Brussels 

(E) COUNTRY: Belgium 

(F) POSTAL CODE: 1050 

(G) TELEPHONE: (02) 509*61.11 

10 (ii) TITLE OF INVENTION: Xylanase, microorganisms 

producing it, DNA molecule [sic] , methods for preparing 
this xylanase and uses of the latter 

(iii) NUMBER OF SEQUENCES: 2 9 

(iv) COMPUTER READABLE FORM: 

15 (A) MEDIUM TYPE: Floppy disk 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS: MS-DOS 



(2) INFORMATION FOR SEQ ID NO:l: 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 663 base pairs 



CAAATCGTCA CCGACAATTC CATTGGCAAC CACGATGGCT ATGATTATGA ATTTTGGAAA 



60 
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GATAGCGGTG GCTCTGGGAC AATGATTCTC AATCATGGCG GTACGTTCAG TGCCCAATGG 
AACAATGTTA ACAACATATT ATTCCGTAAA GGTAAAAAAT TCAATGAAAC ACAAACACAC 
CAACAAGTTG GTAACATGTC CATAAACTAC GGAGCCAACT TCCAACCAAA TGGTAATGCG 
TATTTATGCG TCTATGGTTG GACTGTTGAC CCTCTTGTCG AATATTATAT TGTCGACAGT 
TGGGGCAACT GGCGTCCACC AGGAGCAACG CCTAAGGGGA CCATCACTGT TGATGGAGGA 
ACATATGATA TCTACGAGAC TCTTAGAGTC AATCAACCCT CCATTAAGGG GATTGC CACA 
TTTAAACAAT ATTGGAGTGT TCGAAGATCG AAACGCACGA GTGGCACGAT TTCTGTCAGC 
AAC CACTTTA GAGCGTGGGA AAACTTAGGG ATGAATATGG GGAAAATGTA TGAAGTCGCG 
CTTACTGTAG AAGGCTATCA AAGTAGCGGA AGTGCTAATG TATATAGCAA TACAC TAAGA 
ATTAACGGTA ACCCTCTCTC AACTATTAGT AATGACGAGA GCATAACTTT GGATAAAAAC 
AAT 

(2) INFORMATION FOR SEQ ID NO: 2: 



12C 

180 

240 

300 

36C 

420 

480 

540 

600 

660 

663 



5 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 663 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(ii) 



MOLECULE TYPE: genomic DNA 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
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CAA ATC GTC ACC GAC AAT TCC ATT GGC AAC CAC GAT GGC TAT GAT TAT 4 8 

Gin lie Val Thr Asp Asn Ser lie Gly Asn His Asp Gly Tyr Asp Tyr 
15 10 15 

GAA TTT TGG AAA GAT AGC GGT GGC TCT GGG ACA ATG ATT CTC AAT CAT 9 6 

Glu Phe Trp Lys Asp Ser Gly Gly Ser Gly Thr Met He Leu Asn His 
20 25 30 

GGC GGT ACG TTC AGT GCC CAA TGG AAC AAT GTT AAC AAC ATA TTA TTC 144 
Gly Gly Thr Phe Ser Ala Gin Trp Asn Asn Val Asn Asn He Leu Phe 
35 40 45 

CGT AAA GGT AAA AAA TTC AAT GAA ACA CAA ACA CAC CAA CAA GTT GGT 192 
Arg Lys Gly Lys Lys Phe Asn Glu Thr Gin Thr His Gin Gin Val Gly 
50 55 60 

AAC ATG TCC ATA AAC TAC GGA GCC AAC TTC CAA CCA AAT GGT AAT GCG 24 0 

Asn Met Ser He Asn Tyr Gly Ala Asn Phe Gin Pro Asn Gly Asn Ala 
65 70 75 80 

TAT TTA TGC GTC TAT GGT TGG ACT GTT GAC CCT CTT GTC GAA TAT TAT 2 88 

Tyr Leu Cys Val Tyr Gly Trp Thr Val Asp Pro Leu Val Glu Tyr Tyr 
85 90 95 

ATT GTC GAC AGT TGG GGC AAC TGG CGT CCA CCA GGA GCA ACG CCT AAG 33 6 

He Val Asp Ser Trp Gly Asn Trp Arg Pro Pro Gly Ala Thr Pro Lys 
100 105 110 

GGG ACC ATC ACT GTT GAT GGA GGA ACA TAT GAT ATC TAC GAG ACT CTT 3 84 

Gly Thr He Thr Val Asp Gly Gly Thr Tyr Asp He Tyr Glu Thr Leu 
115 120 125 

AGA GTC AAT CAA CCC TCC ATT AAG GGG ATT GCC ACA TTT AAA CAA TAT 432 
Arg Val Asn Gin Pro Ser He Lys Gly He Ala Thr Phe Lys Gin Tyr 
130 135 140 
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TGG AGT GTT CGA AGA TCG AAA CGC ACG AGT GGC ACG ATT TCT GTC AGC 48 G 

Trp Ser Val Arg Arg Ser Lys Arg Thr Ser Gly Thr lie Ser Val Ser 

145 150 155 160 



AAC CAC TTT AGA GCG TGG GAA AAC TTA GGG ATG AAT ATG GGG AAA ATG 52 8 

Asn His Phe Arg Ala Trp Glu Asn Leu Gly Met Asn Met Gly Lys Met 
165 170 175 



TAT GAA GTC GCG CTT ACT GTA GAA GGC TAT CAA AGT AGC GGA AGT GCT 576 
Tyr Glu Val Ala Leu Thr Val Glu Gly Tyr Gin Ser Ser Gly Ser Ala 
180 185 190 



AAT GTA TAT AGC AAT ACA CTA AGA ATT AAC GGT AAC CCT CTC TCA ACT 624 
Asn Val Tyr Ser Asn Thr Leu Arg lie Asn Gly Asn Pro Leu Ser Thr 
195 200 205 



ATT AGT AAT GAC GAG AGC ATA ACT TTG GAT AAA AAC AAT 663 
lie Ser Asn Asp Glu Ser lie Thr Leu Asp Lys Asn Asn 
210 215 220 

(2) INFORMATION FOR SEQ ID NO : 3 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 221 amino acids 

(B) TYPE : amino acid 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal fragment 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus 
10 (B) STRAIN: 720/1 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
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Gin lie Val Thr Asp Asn Ser lie Gly Asn His Asp Gly Tyr Asp Tyr 
15 10 15 

Glu Phe Trp Lys Asp Ser Gly Gly Ser Gly Thr Met lie Leu Asn His 

20 25 30 

Gly Gly Thr Phe Ser Ala Gin Trp Asn Asn Val Asn Asn lie Leu Phe 
35 40 45 

Arg Lys Gly Lys Lys Phe Asn Glu Thr Gin Thr His Gin Gin Val Gly 
50 55 60 

Asn Met Ser lie Asn Tyr Gly Ala Asn Phe Gin Pro Asn Gly Asn Ala 

65 70 75 80 

Tyr Leu Cys Val Tyr Gly Trp Thr Val Asp Pro Leu Val Glu Tyr Tyr 
85 90 95 

lie Val Asp Ser Trp Gly Asn Trp Arg Pro Pro Gly Ala Thr Pro Lys 
100 105 110 

Gly Thr lie Thr Val Asp Gly Gly Thr Tyr Asp lie Tyr Glu Thr Leu 
115 120 125 

Arg Val Asn Gin Pro Ser lie Lys Gly lie Ala Thr Phe Lys Gin Tyr 
130 135 140 

Trp Ser Val Arg Arg Ser Lys Arg Thr Ser Gly Thr lie Ser Val Ser 
145 150 155 160 

Asn His Phe Arg Ala Trp Glu Asn Leu Gly Met Asn Met Gly Lys Met 
165 170 175 



Tyr Glu Val Ala Leu Thr Val Glu Gly Tyr Gin Ser Ser Gly Ser Ala 
180 185 190 
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Asn Val Tyr Ser Asn Thr Leu Arg lie Asn Gly Asn Pro Leu Ser Thr 
195 200 205 

lie Ser Asn Asp Glu Ser lie Thr Leu Asp Lys Asn Asn 
210 215 220 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 744 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

ATGAGACAAA AGAAATTGAC GTTGATTTTA GCCTTTTTAG TTTGTTTTGC ACTAACCTTA 60 

CCTGCAGAAA TAATTCAGGC ACAAATCGTC ACCGACAATT CCATTGGCAA CCACGATGGC 120 

TATGATTATG AATTTTGGAA AGATAGCGGT GGCTCTGGGA CAATGATTCT CAATCATGGC 180 

GGTACGTTCA GTGCCCAATG GAACAATGTT AACAACATAT TATTC CGTAA AGGTAAAAAA 24 0 

TTCAATGAAA CACAAACACA CCAACAAGTT GGTAACATGT CCATAAACTA CGGAGCCAAC 3 00 

TTCCAACCAA ATGGTAATGC GTATTTATGC GTCTATGGTT GGACTGTTGA CCCTCTTGTC 360 

GAATATTATA TTGTCGACAG TTGGGGCAAC TGGCGTCCAC CAGGAGCAAC GCCTAAGGGG 420 

ACCATCACTG TTGATGGAGG AACATATGAT ATCTACGAGA CTCTTAGAGT CAATCAACCC 480 

TCCATTAAGG GGATTGCCAC ATTTAAACAA TATTGGAGTG TTCGAAGATC GAAACGCACG 54 0 

AGTGGCACGA TTTCTGTCAG CAACCACTTT AGAGCGTGGG AAAACTTAGG GATGAATATG 600 
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GGGAAAATGT ATGAAGTCGC GCTTACTGTA GAAGGCTATC AAAGTAGCGG AAGTGCTAAT 66C 

GTATATAGCA ATACACTAAG AATTAACGGT AACCCTCTCT CAACTATTAG TAATGACGAG 72 0 

AGCATAACTT TGGATAAAAA CAAT 744 
(2) INFORMATION FOR SEQ ID NO; 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 744 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS: single 

(D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



ATG AGA CAA AAG AAA TTG ACG TTG ATT TTA GCC TTT TTA GTT TGT TTT 48 
Met Arg Gin Lys Lys Leu Thr Leu lie Leu Ala Phe Leu Val Cys Phe 
-25 -20 -15 

GCA CTA ACC TTA CCT GCA GAA ATA ATT CAG GCA CAA ATC GTC ACC GAC 96 
Ala Leu Thr Leu Pro Ala Glu lie lie Gin Ala Gin lie Val Thr Asp 
-10 -5 -11 5 

AAT TCC ATT GGC AAC CAC GAT GGC TAT GAT TAT GAA TTT TGG AAA GAT 144 
Asn Ser lie Gly Asn His Asp Gly Tyr Asp Tyr Glu Phe Trp Lys Asp 
10 15 20 



AGC GGT GGC TCT GGG ACA ATG ATT CTC AAT CAT GGC GGT ACG TTC AGT1 
Ser Gly Gly Ser Gly Thr Met He Leu Asn His Gly Gly Thr Phe Ser 
25 30 35 



92 
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GCC CAA TGG AAC AAT GTT AAC AAC ATA TTA TTC CGT AAA GGT AAA AAA 24 C 

Ala Gin Trp Asn Asn Val Asn Asn lie Leu Phe Arg Lys Gly Lys Lys 
40 45 50 

TTC AAT GAA ACA CAA ACA CAC CAA CAA GTT GGT AAC ATG TCC ATA AAC 288 
Phe Asn Glu Thr Gin Thr His Gin Gin Val Gly Asn Met Ser lie Asn 
55 60 65 

TAC GGA GCC AAC TTC CAA CCA AAT GGT AAT GCG TAT TTA TGC GTC TAT 33 6 

Tyr Gly Ala Asn Phe Gin Pro Asn Gly Asn Ala Tyr Leu Cys Val Tyr 
70 75 SC 85 

GGT TGG ACT GTT GAC CCT CTT GTC GAA TAT TAT ATT GTC GAC AGT TGG 384 
Gly Trp Thr Val Asp Pro Leu Val Glu Tyr Tyr He Val Asp Ser Trp 
90 95 100 

GGC AAC TGG CGT CCA CCA GGA GCA ACG CCT AAG GGG ACC ATC ACT GTT 432 
Gly Asn Trp Arg Pro Pro Gly Ala Thr Pro Lys Gly Thr He Thr Val 
105 110 115 

GAT GGA GGA ACA TAT GAT ATC TAC GAG ACT CTT AGA GTC AAT CAA CCC 480 
Asp Gly Gly Thr Tyr Asp He Tyr Glu Thr Leu Arg Val Asn Gin Pro- 
120 125 130 

TCC ATT AAG GGG ATT GCC ACA TTT AAA CAA TAT TGG AGT GTT CGA AGA 528 
Ser He Lys Gly He Ala Thr Phe Lys Gin Tyr Trp Ser Val Arg Arg 
135 140 145 

TCG AAA CGC ACG AGT GGC ACG ATT TCT GTC AGC AAC CAC TTT AGA GCG 576 
Ser Lys Arg Thr Ser Gly Thr He Ser Val Ser Asn His Phe Arg Ala 
150 155 160 165 



TGG GAA AAC TTA GGG ATG AAT ATG GGG AAA ATG TAT GAA GTC GCG CTT 
Trp Glu Asn Leu Gly Met Asn Met Gly Lys Met Tyr Glu Val Ala Leu 
170 175 180 



624 
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ACT GTA GAA GGC TAT CAA AGT AGC GGA AGT GCT AAT GTA TAT AGC AAT 672 
Thr Val Glu Gly Tyr Gin Ser Ser Gly Ser Ala Asn Val Tyr Ser Asn 
185 190 195 

ACA CTA AGA ATT AAC GGT AAC CCT CTC TCA ACT ATT AGT AAT GAC GAG 720 
Thr Leu Arg lie Asn Gly Asn Pro Leu Ser Thr He Ser Asn Asp Glu 
200 205 210 



AGC ATA ACT TTG GAT AAA AAC AAT 
Ser He Thr Leu Asp Lys Asn Asn 
215 220 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 248 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal fragment 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus 
10 (B) SOURCE: 720/1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



Met Arg Gin Lys Lys Leu Thr Leu 
-25 -20 

Ala Leu Thr Leu Pro Ala Glu He 



He Leu Ala Phe Leu Val Cys Phe 
-15 

He Gin Ala Gin He Val Thr Asp 
-11 5 
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Asn Ser He Gly Asn His Asp Gly Tyr Asp Tyr Glu Phe Trp Lys Asp 
10 15 20 

Ser Gly Gly Ser Gly Thr Met He Leu Asn His Gly Gly Thr Phe Ser 
25 30 35 

Ala Gin Trp Asn Asn Val Asn Asn He Leu Phe Arg Lys Gly Lys Lys 
40 45 50 

Phe Asn Glu Thr Gin Thr His Gin Gin Val Gly Asn Met Ser He Asn 
55 60 65 

Tyr Gly Ala Asn Phe Gin Pro Asn Gly Asn Ala Tyr Leu Cys Val Tyr 
70 75 80 85 

Gly Trp Thr Val Asp Pro Leu Val Glu Tyr Tyr He Val Asp Ser Trp 
90 95 100 

Gly Asn Trp Arg Pro Pro Gly Ala Thr Pro Lys Gly Thr He Thr Val 
105 110 115 

Asp Gly Gly Thr Tyr Asp He Tyr Glu Thr Leu Arg Val Asn Gin Pro 
120 125 130 

Ser He Lys Gly He Ala Thr Phe Lys Gin Tyr Trp Ser Val Arg Arg 
135 140 145 

Ser Lys Arg Thr Ser Gly Thr He Ser Val Ser Asn His Phe Arg Ala 
150 155 160 165 

Trp Glu Asn Leu Gly Met Asn Met Gly Lys Met Tyr Glu Val Ala Leu 
170 175 180 



Thr Val Glu Gly Tyr Gin Ser Ser Gly Ser Ala Asn Val Tyr Ser Asn 
185 190 195 
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Thr Leu Arg lie Asn Gly Asn Pro Leu Ser Thr lie Ser Asn Asp Glu 
200 205 210 

Ser lie Thr Leu Asp Lys Asn Asn 
215 220 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 



ATGAGACAAA AGAAATTGAC GTTGATTTTA GCCTTTTTAG TTTGTTTTGC ACTAACCTTA 

81 

CCTGCAGAAA TAATTCAGGC A 



(2) INFORMATION FOR SEQ ID NO: 8: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 base pairs 

(B) TYPE: nucleic acid 

(C) ST RAND EDNESS : single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
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ATG AGA CAA AAG AAA TTG ACG TTG ATT TTA GCC TTT TTA GTT TGT TTT 4 8 

Met Arg Gin Lys Lys Leu Thr Leu lie Leu Ala Phe Leu Val Cys Phe 
15 10 15 

GCA CTA ACC TTA CCT GCA GAA ATA ATT CAG GCA Si 
Ala Leu Thr Leu Pro Ala Glu lie lie Gin Ala 
20 25 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal fragment 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus 
10 (B) SOURCE: 720/1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Arg Gin Lys Lys Leu Thr Leu He Leu Ala Phe Leu Val Cys Phe 
15 10 15 

Ala Leu Thr Leu Pro Ala Glu lie He Gin Ala 
20 25 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1513 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



AAATTGAATT GTGTATATCT AATGATAACG ACAAATCGTC ACTGTTTTTA AACTAATCTC 6 0 

AAACCAATAC TTCTTTATTT AACGCTAACC ACTTGCAATC TTATCACAAG AACATTCTTT 120 

ATAGGAACTT TCCCATTTGC AAGACGATAA AAAATCTTTT TCCCCTATTT TATCTTATCG 180 

CCTTGATCGG TTTAATTTGT AAACTTTATT TTAGTTTACG TGATGTTCCC TCATTCATAC 24 0 

CATTAATCAC AGTTAACGCT AGAGTCATCT TTTTTCGGTT CTCAAAAATA CCTGAAGAAC 3 00 

ATTTATGTCA TATTTTCTCA CGCCGCTCCA TAATGGAATA TATATACTCT TTTATACATA 360 

TTAAGTAAAT TAGTATATAC TTGCGTTATC AAAATGTGAG ATAATCTAAT TGATCAAACA 42 0 

AGCAGCTATC CAAAAAACAC TGATGTTGAC CTCTTAAAGA AGTGTCACTA TCTATGAAAA 4 80 

GATAATTATC CAGTTTCAAA ATTTGAAATA GTGTGTATGG AATAGTTTGA ATGTCAACTG 540 

CTGTGAAAGG AGGGTAGGTA GTACCGTAGA CTTCATTACC AAAAATTAGT TGTAAAAAAA 600 

TTAAAAGGAG GAATGC CTAA TGAGACAAAA GAAATTGACG TTGATTTTAG CCTTTTTAGT 6 60 

TTGTTTTGCA CTAACCTTAC CTGCAGAAAT AATTCAGGCA CAAATCGTCA CCGACAATTC 720 

CATTGGCAAC CACGATGGCT ATGATTATGA ATTTTGGAAA GATAGCGGTG GCTCTGGGAC 78 0 

A^TGATTCTC AATCATGGCG GTACGTTCAG TGCCCAATGG AACAATGTTA ACAACATATT 840 

ATTC CGTAAA GGTAAAAAAT TCAATGAAAC ACAAACACAC CAACAAGTTG GTAACATGTC 9 00 

CATAAACTAC GGAGCCAACT TCCAACCAAA TGGTAATGCG TATTTATGCG TCTATGGTTG 9€0 



GACTGTTGAC CCTCTTGTCG AATATTATAT TGTCGACAGT TGGGGCAACT GGCGTCCACC 
AGGAG CAACG CCTAAGGGGA CCATCACTGT TGATGGAGGA ACATATGATA TCTACGAGAC 
TCTTAGAGTC AATCAACCCT CCATTAAGGG GATTGCCACA TTTAAACAAT ATTGGAGTGT 



- 64 - 

TCGAAGATCG AAACGCACGA GTGGCACGAT TTCTGTCAGC AACCACTTTA GAGCGTGGGA 12 00 

AAACTTAGGG ATGAATATGG GGAAAATGTA TGAAGTCGCG CTTACTGTAG AAGGCTATCA 1260 

AAGTAGCGGA AGTGCTAATG TATATAGCAA TACACTAAGA ATTAACGGTA ACCCTCTCTC 132 C 

AACTATTAGT AATGACGAGA GCATAACTTT GGATAAAAAC AATTAAAAAT CCTTATCTCT 13 SO 

TTCGGTTCAG TTCTCATTAT TTTCAAATAA CCTCCCGGTT GGATCTTTTC CAACGGGAGG 1440 

TTTTATTGGA AAGGTTAAGT ATAGTATACT CCGATTC CAT CCAGAGGAAT GCTTGAAACA 1500 

CCTCCGTCAC TAG 1513 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1513 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



AAATTGAATT GTGTATATCT AATGATAACG ACAAATCGTC ACTGTTTTTA 



60 
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AAAC CAAT AC TT CTTTATTT AACGCTAACC ACTTGCAATC TTATCACAAG AACATTCTTT 12 0 

ATAGGAACTT TCCCATTTGC AAGACGATAA AAAATCTTTT TCCCCTATTT TATCTTATCG 180 

CCTTGATCGG TTTAATTTGT AAACTTTATT TTAGTTTACG TGATGTTCCC TCATTCATAC 24 0 

CATTAATCAC AGTTAACGCT AGAGTCATCT TTTTTCGGTT CTCAAAAATA CCTGAAGAAC 30 0 

ATTTATGTCA TATTTTCTCA CGCCGCTCCA TAATGGAATA TATATACTCT TTTATACATA 360 

TTAAGTAAAT TAGTATATAC TTGCGTTATC AAAATGTGAG ATAATCTAAT TGATCAAACA 420 

AGCAGCTATC CAAAAAACAC TGATGTTGAC CTCTTAAAGA AGTGTCACTA TCTATGAAAA 4 80 

GATAATTATC CAGTTTCAAA ATTTGAAATA GTGTGTATGG AATAGTTTGA ATGTCAACTG 540 

CTGTGAAAGG AGGGTAGGTA GTACCGTAGA CTTCATTACC AAAAATTAGT TGTAAAAAAA 600 

TTAAAAGGAG GAATGCCTA ATG AGA CAA AAG AAA TTG ACG TTG ATT TTA GCC 652 

Met Arg Gin Lys Lys lieu Thr Leu lie Leu Ala 

-25 -20 

TTT TTA GTT TGT TTT GCA CTA ACC TTA CCT GCA GAA ATA ATT CAG GCA 700 
Phe Leu Val Cys Phe Ala Leu Thr Leu Pro Ala Glu lie lie Gin Ala 
-15 -10 -5 -1 

CAA ATC GTC ACC GAC AAT TCC ATT GGC AAC CAC GAT GGC TAT GAT TAT 74 8 

Gin lie Val Thr Asp Asn Ser lie Gly Asn His Asp Gly Tyr Asp Tyr 
15 10 15 

GAA TTT TGG AAA GAT AGC GGT GGC TCT GGG ACA ATG ATT CTC AAT CAT 796 
Glu Phe Trp Lys Asp Ser Gly Gly Ser Gly Thr Met lie Leu Asn His 
20 25 30 
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GGC GGT ACG TTC AGT GCC CAA TGG AAC AAT GTT AAC AAC ATA TTA TTC 844 
Gly Gly Thr Phe Ser Ala Gin Trp Asn Asn Val Asn Asn lie Leu Phe 
• 35 40 45 

CGT AAA GGT AAA AAA TTC AAT GAA ACA CAA ACA CAC CAA CAA GTT GGT 892 
Arg Lys Gly Lys Lys Phe Asn Glu Thr Gin Thr His Gin Gin Val Gly 
50 55 60 

AAC ATG TCC ATA AAC TAC GGA GCC AAC TTC CAA CCA AAT GGT AAT GCG 94 C 

Asn Met Ser lie Asn Tyr Gly Ala Asn Phe Gin Pro Asn Gly Asn Ala 
65 70 75 80 

TAT TTA TGC GTC TAT GGT TGG ACT GTT GAC CCT CTT GTC GAA TAT TAT 988 
Tyr Leu Cys Val Tyr Gly Trp Thr Val Asp Pro Leu Val Glu Tyr Tyr 
85 90 95 

ATT GTC GAC AGT TGG GGC AAC TGG CGT CCA CCA GGA GCA ACG CCT AAG 103 6 

lie Val Asp Ser Trp Gly Asn Trp Arg Pro Pro Gly Ala Thr Pro Lys 
100 105 110 

GGG ACC ATC ACT GTT GAT GGA GGA ACA TAT GAT ATC TAC GAG ACT CTT 1084 
Gly Thr lie Thr Val Asp Gly Gly Thr Tyr Asp He Tyr Glu Thr Leu 
115 120 125 

AGA GTC AAT CAA CCC TCC ATT AAG GGG ATT GCC ACA TTT AAA CAA TAT 1132 
Arg Val Asn Gin Pro Ser He Lys Gly He Ala Thr Phe Lys Gin Tyr 
130 135 140 

TGG AGT GTT CGA AGA TCG AAA CGC ACG AGT GGC ACG ATT TCT GTC AGC 1180 
Trp Ser Val Arg Arg Ser Lys Arg Thr Ser Gly Thr He Ser Val Ser 
145 150 155 160 



AAC CAC TTT AGA GCG TGG GAA AAC TTA GGG ATG AAT ATG GGG AAA ATG 
Asn His Phe Arg Ala Trp Glu Asn Leu Gly Met Asn Met Gly Lys Met 
165 170 175 



1228 
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TAT GAA GTC GCG CTT ACT GTA GAA GGC TAT CAA AGT AGC GGA AGT GCT 127 6 
Tyr Glu Val Ala Leu Thr Val Glu Gly Tyr Gin Ser Ser Gly Ser Ala 
180 185 190 

AAT GTA TAT AGC AAT ACA CTA AGA ATT AAC GGT AAC CCT CTC TCA ACT 1324 
Asn Val Tyr Ser Asn Thr Leu Arg lie Asn Gly Asn Pro Leu Ser Thr 
195 200 205 

ATT AGT AAT GAC GAG AGC ATA ACT TTG GAT AAA AAC AAT TAAAAATCCT 13 73 
lie Ser Asn Asp Glu Ser lie Thr Leu Asp Lys Asn Asn 
210 215 220 

TATCTCTTTC GGTTCAGTTC TCATTATTTT CAAATAACCT CCCGGTTGGA TCTTTTCCAA 14 33 

CGGGAGGTTT TATTGGAAAG GTTAAGTATA GTATACTC CG ATTCCATCCA GAGGAATGCT 1493 

TGAAACACCT CCGTCACTAG 1513 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 619 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
AAATTGAATT GTGTATATCT AATGATAACG ACAAATCGTC ACTGTTTTTA AACTAATCTC 
AAACCAATAC TTCTTTATTT AACGCTAACC ACTTGCAATC TTATCACAAG AACATTCTTT 
ATAGGAACTT TCCCATTTGC AAGACGATAA AAAATCTTTT TCCCCTATTT TATCTTATCG 



60 



120 



180 
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CCTTGATCGG TTTAATTTGT AAACTTTATT TTAGTTTACG TGATGTTCCC T CATT CAT AC 24 0 

CATTAATCAC AGTTAACGCT AGAGTCATCT TTTTTCGGTT CTCAAAAATA CCTGAAGAAC 3 00 

ATTTATGTCA TATTTTCTCA CGCCGCTCCA TAATGGAATA TATATACTCT TTTATACATA 3 60 

TTAAGTAAAT TAGTATATAC TTGCGTTATC AAAATGTGAG ATAATCTAAT TG AT CAAACA 42 0 

AGCAGCTATC CAAAAAACAC TGATGTTGAC CTCTTAAAGA AGTGTCACTA TCTATGAAAA 4 80 

GATAATTATC CAGTTTCAAA ATTTGAAATA GTGTGTATGG AATAGTTTGA ATGTCAACTG 54 0 

CTGTGAAAGG AGGGTAGGTA GTACCGTAGA CTTCATTACC AAAAATTAGT TGTAAAAAAA 600 

TTAAAAGGAG GAATGCCTA 6 IS 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 150 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



TAAAAATCCT TATCTCTTTC GGTTCAGTTC TCATTATTTT CAAATAACCT C CCGGTTGGA 60 
TCTTTTCCAA CGGGAGGTTT TATTGGAAAG GTTAAGTATA GTATACTCCG ATTCCATCCA 120 
GAGGAATGCT TGAAACACCT CCGTCACTAG 150 



(2) INFORMATION FOR SEQ ID NO : 14 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid (synthetic oligo 
nucleotide) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

CCCCCCTACG TAGCGGCCGC CCCGGCCGGT AACCTAGGAA GTCAGCGCCC TGCACC 
(2) INFORMATION FOR SEQ ID NO: 15: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

15 (ii) MOLECULE TYPE: nucleic acid (synthetic oligo 

nucleotide) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CCCCCCTACG TAGGCCGGGG CGGCCGCGGT TACCTAGGGC CTCGTGATAC GCCTAT 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: nucleic acid (synthetic oligo- 
nucleotide) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 

ACGAGGAAAG ATGCTGTTCT TGTAAATGAG T 



(2) INFORMATION FOR SEQ ID NO: 17: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: nucleic acid (synthetic oligo- 

nucleotide) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

TACCTTGTCT ACAAACCCC j 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS : 
15 (A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid (synthetic oligo- 
20 nucleotide) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



CGGTCGCCGC ATACACTA 



18 
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(2) INFORMATION FOR SEQ ID NO: 19: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: nucleic acid (synthetic oligo- 
nucleotide) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

CCCCCCCCCG GTAACCTGCA TTAATGAATC GGCCAA 

10 (2) INFORMATION FOR SEQ ID NO: 20: 



(ii) MOLECULE TYPE: nucleic acid (synthetic oligo- 
nucleotide) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

CCCCCCCCCG GTTACCGTAT TTATTAACTT CTCCTAGTA 

(2) INFORMATION FOR SEQ ID NO: 21: 
20 (i) SEQUENCE CHARACTERISTICS: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 



15 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



* 
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(ii) MOLECULE TYPE: nucleic acid (synthetic oligo- 
nucleotide) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

CCCCCCTCTA GATTAATTAA CCAAGCTTGG GATCCGTCGA CCTGCAGATC 

(2) INFORMATION FOR SEQ ID NO: 22: 
5 (i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



10 



(ii) MOLECULE TYPE: nucleic acid (synthetic oligo- 



nucleotide) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 



CCCCCCTGAA ATCAGCTGGA CTAAAAGGGA TGCAATTTC 



39 



(2) INFORMATION FOR SEQ ID NO:23: 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: nucleic acid (synthetic oligo- 



20 



nucleotide) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 



CCCCCCGTCG ACCGCATGCG CCGGCACAGC 



30 



- 73 - 



(2) INFORMATION FOR SEQ ID NO: 24: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 



5 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: nucleic acid (synthetic oligo- 
nucleotide) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

CCCCCCGCAT GCGCAAATCG TCACCGACAA TTCCATTGG 
10 (2) INFORMATION FOR SEQ ID NO: 25: 



(ii) MOLECULE TYPE: nucleic acid (synthetic oligo- 
nucleotide) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

TACCTTGTCT ACAAACCCC 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 



15 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



19 



(2) INFORMATION FOR SEQ ID NO: 26: 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 185 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

TCATGTAACT CGCCTTGATC TATTTCATTT GTATCAAAGG ATTTATACAC AAACAAGAGA 
CATCCATGCC GGGTTAAAGC AGTATCGTTC CATCTAACAG AGAAGGNCTG CATGAAAGGA 
GGTGATGGGT TTTTCATCTT AGGGATGACA GAACAATACG GATGAAAAAA GGAGAGGGAT 
GGAAA 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

5 (A) LENGTH: 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

10 ( X i) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

ATGAATTTGA AAAGATTGAG GCTGTTGTTT GTGATGTGTA TTGGATTTGT GCTGACACTG 6 0 

ACGGCTGTGC CGGCTCATGC G 81 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 base pairs 

(B) TYPE: nucleic acid 
!5 (C) STRANDEDNESS: single 

(D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

ATG AAT TTG AAA AGA TTG AGG CTG TTG TTT GTG ATG TGT ATT GGA TTT 
Met Asn Leu Lys Arg Leu Arg Leu Leu Phe Val Met Cys He Gly Phe 
x s 10 " 



4S 
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GTG CTG ACA CTG ACG GCT GTG CCG GCT CAT GCG 



81 



Val Leu Thr Leu Thr Ala Val Pro Ala His Ala 



20 



25 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Met Asn Leu Lys Arg Leu Arg Leu Leu Phe Val Met Cys lie Gly Phe 



1 



5 



10 



15 



Val Leu Thr Leu Thr Ala Val Pro Ala His Ala 



20 



25 



